# Localized deployment
Qwen Qwen3 8B GGUF
Apache-2.0
Quantized version of Qwen3-8B, quantized using the imatrix option of llama.cpp, suitable for text generation tasks.
Large Language Model
Q
bartowski
23.88k
18
Llama 4 Scout 17b 16e It Gguf
Other
An image-text to text conversion model built on the Meta Llama base model, supporting interaction through gguf-connector and llama-cpp-python.
Image-to-Text
L
chatpig
258
0
Mistral Small 3.1 24B Instruct 2503
Apache-2.0
Mistral Small 3.1 is a large multimodal language model with 24 billion parameters, possessing visual understanding ability and 128k long context processing ability, suitable for various tasks.
Image-to-Text Supports Multiple Languages
M
chutesai
2,035
0
Gemma 3 12b It Q8 0 GGUF
This model is converted from google/gemma-3-12b-it to GGUF format, suitable for the llama.cpp framework.
Large Language Model
G
NikolayKozloff
89
1
Gemma 3 1b It GGUF
The GGUF quantized version of the Gemma 3 1B model, suitable for text generation tasks.
Large Language Model
G
MaziyarPanahi
256.05k
4
Llama 3.2 1B Instruct GGUF
The GGUF format version of Llama-3.2-1B-Instruct, providing broader support and better performance.
Large Language Model
L
MaziyarPanahi
190.76k
12
Qwen2.5 1.5B Instruct GGUF
The GGUF format file of the Qwen2.5-1.5B-Instruct model, suitable for text generation tasks.
Large Language Model
Q
MaziyarPanahi
183.11k
6
Bielik 11B V2.3 Instruct GGUF
Apache-2.0
This is the GGUF quantized version of the Polish large language model Bielik-11B-v2.3-Instruct developed by SpeakLeash, suitable for local deployment and use.
Large Language Model
Transformers

B
speakleash
2,203
29
Phi 3 Mini 4k Instruct Q4 K M GGUF
MIT
This model was converted from microsoft/Phi-3-mini-4k-instruct to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.
Large Language Model Supports Multiple Languages
P
matrixportal
67
3
Meta Llama 3.1 405B Instruct GGUF
Meta-Llama-3.1-405B-Instruct is a large language model with 405 billion parameters based on the Llama 3.1 architecture, optimized for instruction-following tasks and supporting multiple languages.
Large Language Model Supports Multiple Languages
M
MaziyarPanahi
189.43k
14
Gemma 2b It Q4 K M GGUF
The GGUF quantized version of the Gemma-2b-it model, suitable for local inference and supporting text generation tasks.
Large Language Model
Transformers

G
codegood
434
1
Breeze 7B Instruct V1 0
Apache-2.0
Breeze-7B-Instruct is a Traditional Chinese optimized language model based on Mistral-7B, specifically designed for instruction-following tasks, supporting scenarios such as Q&A and multi-turn dialogues.
Large Language Model
Transformers Supports Multiple Languages

B
MediaTek-Research
1,388
61
Chatmusician GGUF
MIT
ChatMusician-GGUF is a text generation model based on the GGUF format, suitable for music - related text generation tasks.
Large Language Model
Transformers English

C
MaziyarPanahi
315
13
Instagger
Apache-2.0
InsTagger is a tool for automatically providing instruction tags. It achieves its function by extracting tag results from InsTag and is mainly used to analyze large language model supervised fine-tuning data consistent with human preferences.
Large Language Model
Transformers English

I
OFA-Sys
2,303
22
Featured Recommended AI Models